PCluster: Probabilistic Agglomerative Clustering of Gene Expression Profiles
نویسنده
چکیده
A central problem in analysis of gene expression data is clustering of genes with similar expression profiles. In this paper, I describe an hierarchical clustering procedure that is based on simple probabilistic model. This procedure clusters genes with respect to a target classification of conditions, so that genes that are expressed similarly in each group of conditions are clustered together.
منابع مشابه
Dynamic agglomerative clustering of gene expression profiles
The increasing use of microarray technologies is generating a large amount of data that must be processed to extract underlying gene expression patterns. Existing clustering methods could suffer from certain drawbacks. Most methods cannot automatically separate scattered, singleton and mini-cluster genes from other genes. Inclusion of these types of genes into regular clustering processes can i...
متن کاملData Complexity in Clustering Analysis of Gene Microarray Expression Profiles
The increasing application of microarray technology is generating large amounts of high dimensional gene expression data. Genes participating in the same biological process tend to have similar expression patterns, and clustering is one of the most useful and efficient methods for identifying these patterns. Due to the complexity of microarray profiles, there are some limitations in directly ap...
متن کاملTA-clustering: Cluster analysis of gene expression profiles through Temporal Abstractions
This paper describes a new technique for clustering short time series of gene expression data. The technique is a generalization of the template-based clustering and is based on a qualitative representation of profiles which are labelled using trend Temporal Abstractions (TAs); clusters are then dynamically identified on the basis of this qualitative representation. Clustering is performed in a...
متن کاملClustering Time-Series Gene Expression Data with Unequal Time Intervals
Abstract. Clustering gene expression data given in terms of time-series is a challenging problem that imposes its own particular constraints, namely exchanging two or more time points is not possible as it would deliver quite different results, and also it would lead to erroneous biological conclusions. We have focused on issues related to clustering gene expression temporal profiles, and devis...
متن کاملPathCluster: a framework for gene set-based hierarchical clustering
MOTIVATION Gene clustering and gene set-based functional analysis are widely used for the analysis of expression profiles. The development of a comprehensive method jointly combining the two methods would allow for greater biological insights. RESULTS We developed a software package, PathCluster for gene set-based clustering via an agglomerative hierarchical clustering algorithm. The distance...
متن کامل